Gas Data
Reading in Data
We first read in two data sets called “gas” representing several
different values having to due with gas stations in the United States.
“Gas” has 72798 observations and 32 total variables.
gas <- read.csv("https://ecoleman451.github.io/website/Data%20Visualization/Datasets/POC.csv")
Mapping
Simple Leaflet
Map
In the following map, each point represents the latitude and
longitude values for a specific gas station (represented by “xcoord” and
“ycoord” respectively in the “gas” data set). Hovering over each point,
you will see the “State”, “County”, “Address”, and “Zip Code” for that
specific gas station. The results are below:
gas_samp <- gas %>% sample_n(500)
# Create a leaflet map
gas_map <- leaflet(data = gas_samp) %>%
addTiles() %>%
addMarkers(
lng = ~xcoord,
lat = ~ycoord,
popup = ~paste("State: ", STATE, "<br>",
"County: ", county, "<br>",
"Address: ", ADDRESS, "<br>",
"Zip Code: ", ZIPnew)
)
# Display the map
gas_map
Leaflet Map
Below, we create a slightly more better leaflet map by using the
“radius” and “color” functions. Here, we have the radius fixed at 5, the
color fixed at blue, and the markers as circle points instead of
location markers. The map is similar to the one above in which each
circle point represents a gas station. Hovering over a point will once
again give the “State”, “County”, “Address”, and “Zip Code” for that
specific gas station.
# Create a leaflet map
gas_map2 <- leaflet(data = gas_samp) %>%
addTiles() %>%
setView(lng = mean(gas_samp$xcoord),
lat = mean(gas_samp$ycoord),
zoom = 13) %>%
addProviderTiles("Esri.WorldGrayCanvas") %>%
addCircleMarkers(
~xcoord,
~ycoord,
color = "blue", # Adjust color as needed
radius = 5, # Adjust radius as needed
stroke = FALSE,
fillOpacity = 0.4,
label = ~paste("State: ", STATE,
"County: ", county,
"Address: ", ADDRESS,
"Zip Code: ", ZIPnew)
) %>%
addLegend(position = "bottomright",
colors = "blue", # Adjust color as needed
labels = "Gas Station",
title = "Gas Stations",
opacity = 0.4)
# Display the map
gas_map2
Best Map
This time, we create a slightly more complex leaflet map by using the
“radius” and “color” functions once again. Here, we have the radius
dependent on the number of POCs (points of compromise) in a gas
station’s respective ZIP Code. This means that the larger a point
appears on the map, the more POCs exist in that station’s ZIP Code. The
color is dependent on whether or not a gas station provides “Fuel”,
“Service Only” or “Both” fuel and services. So, each of those three
categories will have a specific color, representing the services
provided at a specific gas station. In the data, we do not get any gas
stations that provide “Service Only”, so only the categories “Fuel” and
“Both” will appear on the map. The map is similar to the previous maps
above in which each circle point represents a gas station. Hovering over
a point will once again give the “State”, “County”, “Address”, and “Zip
Code” for that specific gas station.
# Create a color palette based on service_or_fuel values
service_palette <- colorFactor(palette = "Set1", domain = gas_samp$service_or_fuel)
# Create the leaflet map
gas_map3 <- leaflet(data = gas_samp) %>%
addTiles() %>%
addProviderTiles("Esri.WorldGrayCanvas") %>%
addCircleMarkers(
~xcoord,
~ycoord,
color = ~service_palette(service_or_fuel), # Use colorFactor
radius = gas_samp$ZIPPOC * 10, # Adjust radius as needed
stroke = FALSE,
fillOpacity = 0.4,
label = ~paste("State: ", STATE, "<br>",
"County: ", county, "<br>",
"Address: ", ADDRESS, "<br>",
"Zip Code: ", ZIPnew)
) %>%
addLegend(position = "bottomright",
colors = service_palette(unique(gas_samp$service_or_fuel)), # Use unique service_or_fuel values
labels = unique(gas_samp$service_or_fuel),
title = "Gas Stations",
opacity = 0.4)
# Display the map
gas_map3
Philly Crime Data
Reading in Data
We first read in two data sets called “philly” representing several
different values having to due with crimes that have occurred in the
Philadelphia area since 2015. “Philly” has 15520 observations and 18
total variables.
philly <- read.csv("https://ecoleman451.github.io/website/Data%20Visualization/Datasets/PhillyCrimeSince2015.csv")
Mapping
Now, we can create a leaflet map looking at fatal versus non-fatal
crimes that occured in Philadelphia in the year 2023 by using the
“color” function once again. The color is dependent on whether or not a
crime was labeled as “Fatal” or “Nonfatal”. So, each category will have
a specific color, representing the type of crime (fatal vs. nonfatal)
that occurred at the time. The map is similar to the previous maps above
in which each circle point represents a specific crime. Hovering over a
point will give the “Neighborhood”, “Date”, “Race”, and “Sex”, “Age”,
and “Street” for that specific crime.
library(leaflet)
library(dplyr)
# Create color palette for fatal and non-fatal crimes
fatal <- "red"
non_fatal <- "blue"
# Create leaflet map
map <- leaflet(philly) %>%
addTiles() %>%
addCircleMarkers(
~lng, ~lat,
color = ifelse(philly$fatal == "Fatal", fatal, non_fatal),
radius = 5,
label = ~paste("Neighborhood: ", neighborhood,
"Date: ", date,
"Race: ", race,
"Sex: ", sex,
"Age: ", age,
"Street: ", street_name),
labelOptions = labelOptions(
direction = "auto"
)
) %>%
addLegend(
position = "bottomright",
colors = c(fatal, non_fatal),
labels = c("Fatal", "Non-Fatal"),
title = "Crime Type"
) %>%
addScaleBar() %>%
addControl(
html = "<h4>Philadelphia Crime Locations (2015-2024)</h4>",
position = "topright"
)
# Display the map
map
Eyeballing the map, we can see that there seems to be a significant
amount more of non-fatal crimes versus fatal crimes. Further analysis
would be needed to confirm this, of course.
Philly Crime Data
Reading in Data
We first read in a data set representing several different values
having to due with the city of Philadelphia. The data, called “Philly”,
contains information on Shootings that have occurred in the area and
categorizes them as either “Fatal” or “Non-Fatal”. “Philly” has 15555
observations and 21 total variables.
philly <- na.omit(st_read("https://pengdsci.github.io/STA553VIZ/w08/PhillyShootings.geojson"))
phillyNeighbor <- st_read("https://pengdsci.github.io/STA553VIZ/w08/Neighborhoods_Philadelphia.geojson")
Mapping
Now, we can create a leaflet map looking at fatal versus non-fatal
crimes that occured in Philadelphia by using the “color” function once
again. The color is dependent on whether or not a crime was labeled as
“Fatal” or “Nonfatal”. So, each category will have a specific color,
representing the type of crime (fatal vs. nonfatal) that occurred at the
time. Each crime location is represented with a circle marker. Hovering
over a point will display information for “Object ID”, “Year”, “Race”,
“Sex”, “Age”, “Wound”, and “Location” for each of the crime points.
# Load required libraries
library(leaflet)
library(sf)
# Convert 'philly' data to sf object
philly_sf <- st_as_sf(philly, coords = c("point_x", "point_y"), crs = 4326)
# Define color palette for fatal and non-fatal crimes
fatal_color <- "red"
non_fatal_color <- "gold"
# Create leaflet map
map <- leaflet() %>%
addProviderTiles(providers$Esri.WorldGrayCanvas) %>%
addPolygons(data = phillyNeighbor,
color = 'skyblue',
weight = 1) %>%
addCircleMarkers(data = philly_sf,
~point_x, ~point_y,
color = ifelse(philly$fatal == 1, fatal_color, non_fatal_color),
radius = 5,
popup = ~paste("Object ID: ", objectid,
"<br>Year: ", year,
"<br>Race: ", race,
"<br>Sex: ", sex,
"<br>Age: ", age,
"<br>Wound: ", wound,
"<br>Location: ", location),
labelOptions = labelOptions(
direction = "auto"
)
) %>%
addLegend(
position = "bottomright",
colors = c("red", "gold"),
labels = c("Fatal", "Non-Fatal"),
title = "Crime Type"
) %>%
addScaleBar() %>%
addControl(
html = "<h4>Philadelphia Crime Locations (2015-2024)</h4>",
position = "topright"
) %>%
addProviderTiles(providers$Esri.WorldGrayCanvas) %>%
setView(lng = -75.1527, lat = 39.9707, zoom = 11)
# Display the map
map
Working Data and Data
Loading
Our initial dataset contains Presidential election results from the
years 2000, 2004, 2008, 2012, 2016, and 2020. The data, called
“election”, has 72617 observations and 12 variables containing
information for every state, county, election, candidate winner in each
county, and total candidate votes.
Some data cleaning was necessary in order to fix the issue of the
county FIPS code, which represents the specific 5 digit code assigned to
every county in the United States. The data set initially had some codes
containing 4 digits instead of 4 if there contained a “0” as the first
digit. For example, Autauga, Alabama’s FIPS code “01001” is recorded as
“1001” in the set. This was able to be done through the excel function
“TEXT” and was done prior to reading the data in as the set called
“election”.
Using this dataset called “election”, we want to split the data so
that we have information focusing only from the county level and a
dataset for the state level. Both sets include a new variable called
“party_percentage” which calculates the percentage of people that picked
the winning party in the sets’ respective state/county - “county_data”
containing election results by county and “state_data” containing
election results by state. Both sets also keep only the winning party in
the dataset.
# Load the required library
library(dplyr)
# Read the data
election <- read.csv("https://ecoleman451.github.io/website/Data%20Visualization/Datasets/PresidentialElection2000To2020.csv")
# County-level Data
county_data <- election %>%
group_by(year, state, county_name) %>%
mutate(party_percentage = candidatevotes / sum(candidatevotes) * 100) %>%
filter(party_percentage == max(party_percentage)) %>%
select(year, state, county_fips, party, candidate, candidatevotes, party_percentage)
# State-level Data
state_data <- election %>%
group_by(year, state) %>%
mutate(party_percentage = candidatevotes / sum(candidatevotes) * 100) %>%
filter(party_percentage == max(party_percentage)) %>%
select(year, state, party, candidate, candidatevotes, party_percentage)
# Save county-level data to a new CSV file
write.csv(county_data, file = "county_level_data.csv", row.names = FALSE)
# Save state-level data to a new CSV file
write.csv(state_data, file = "state_level_data.csv", row.names = FALSE)
Choropleth Map
Now that we’ve created the data set called “county_data” - focusing
solely on election results (specifically the winning party) by county
level, we can create an interactive Choropleth Map using the data
visualization tool Tableau to display presidential election results at
the county level. There are two different colors assigned to represent
the major political parties (Democrat & Republican) and the map
shades in every county to reflect which political party was the winner
in a specific election year. A filter is set to change the year(s) in
the interactive map. Hover text is also included which appears when
hovering over a specific county on the map, containing information for
“year”, “state”, “party”, “candidatevotes”, & “party_percentage” for
the respective county being hovered over.
---
title: "Mapping"
author: "Edward Coleman"
date: "West Chester University"
output:
  html_document: 
    toc: yes
    toc_depth: 4
    toc_float: yes
    code_folding: hide
    code_download: yes
    smooth_scroll: yes
    number_sections: yes
    theme: readable
    editor_options:
      chunk_output_type: inline
---

<style type="text/css">
/* Common styles */
div#TOC li {
    list-style:none;
    background-color:lightgray;
    background-image:none;
    background-repeat:none;
    background-position:0;
    font-family: Arial, Helvetica, sans-serif;
    color: #780c0c;
}

h1.title {
    font-size: 24px;
    color: DarkRed;
    text-align: center;
    font-family: Arial, Helvetica, sans-serif;
    font-variant-caps: normal;
}
h4.author { 
    font-size: 18px;
    font-family: "Times New Roman", Times, serif;
    color: DarkRed;
    text-align: center;
}
h4.date { 
    font-size: 18px;
    font-family: "Times New Roman", Times, serif;
    color: DarkBlue;
    text-align: center;
}
h1 { 
    font-size: 22px;
    font-family: "Times New Roman", Times, serif;
    color: darkred;
    font-weight: bold;
    text-align: center;
}
h2 { 
    font-size: 18px;
    font-family: "Times New Roman", Times, serif;
    color: navy;
    text-align: left;
}
h3 { 
    font-size: 18px;
    font-family: "Times New Roman", Times, serif;
    color: navy;
    font-weight: bold;
    text-align: left;
}
h4 { 
    font-size: 18px;
    font-family: "Times New Roman", Times, serif;
    color: darkred;
    text-align: left;
}

/* Tab features */
.nav>li>a {
    position: relative;
    display: block;
    padding: 2px 15px;
    color: #990000;
}
.nav-pills>li.active>a, .nav-pills>li.active>a:hover, .nav-pills>li.active>a:focus {
    color: #ffffff;
    background-color: #990000;
}
/*
nav-pills>li:nth-child(2) {
    background: green;
 }
 */
img {
    border: 1px solid #555;
}
</style>

```{r setup, include=FALSE, comment=NA}
options(repos = list(CRAN="http://cran.rstudio.com/"))
if (!require("tidyverse")) {
   install.packages("tidyverse")
   library(tidyverse)
}
if (!require("knitr")) {
   install.packages("knitr")
   library(knitr)
}
if (!require("cowplot")) {
   install.packages("cowplot")
   library(cowplot)
}
if (!require("latex2exp")) {
   install.packages("latex2exp")
   library(latex2exp)
}
if (!require("plotly")) {
   install.packages("plotly")
   library(plotly)
}
if (!require("gapminder")) {
   install.packages("gapminder")
   library(gapminder)
}
if (!require("png")) {
    install.packages("png")    
    library("png")
}
if (!require("RCurl")) {
    install.packages("RCurl")    
    library("RCurl")
}
if (!require("colourpicker")) {
    install.packages("colourpicker")              
    library("colourpicker")
}
if (!require("gganimate")) {
    install.packages("gganimate")              
    library("gganimate")
}
if (!require("gifski")) {
    install.packages("gifski")              
    library("gifski")
}
if (!require("magick")) {
    install.packages("magick")              
    library("magick")
}
if (!require("grDevices")) {
    install.packages("grDevices")              
    library("grDevices")
}
if (!require("jpeg")) {
    install.packages("jpeg")              
    library("jpeg")
}
if (!require("ggridges")) {
    install.packages("ggridges")              
    library("ggridges")
}
if (!require("plyr")) {
    install.packages("plyr")              
    library("plyr")
}
if (!require("ggiraph")) {
    install.packages("ggiraph")              
    library("ggiraph")
}
if (!require("highcharter")) {
    install.packages("highcharter")              
    library("highcharter")
}
if (!require("forecast")) {
    install.packages("forecast")              
    library("forecast")
}
if (!require("leaflet")) {
    install.packages("leaflet")              
    library("leaflet")
}
if (!require("sf")) {
    install.packages("sf")              
    library("sf")
}
## 
knitr::opts_chunk$set(echo = TRUE,       
                      warning = FALSE,   
                      result = TRUE,   
                      message = FALSE,
                      comment = NA)
# code chunk specifies whether the R code, warnings, and output 
# will be included in the output files.
if (!require("Stat2Data")) {
   install.packages("Stat2Data")
   library(Stat2Data)
}
if (!require("knitr")) {
   install.packages("knitr")
   library(knitr)
}

knitr::opts_chunk$set(echo = TRUE,       
                      warning = FALSE,   
                      result = TRUE,   
                      message = FALSE,
                      comment = NA)
```

# Gas Data
## Reading in Data
  We first read in two data sets called "gas" representing several different values having to due with gas stations in the United States. "Gas" has 72798 observations and 32 total variables.
```{r, comment=NA}
gas <- read.csv("https://ecoleman451.github.io/website/Data%20Visualization/Datasets/POC.csv")
```

# Mapping
## Simple Leaflet Map
  In the following map, each point represents the latitude and longitude values for a specific gas station (represented by "xcoord" and "ycoord" respectively in the "gas" data set). Hovering over each point, you will see the "State", "County", "Address", and "Zip Code" for that specific gas station. The results are below:
```{r, comment=NA}
 gas_samp <- gas %>% sample_n(500)

# Create a leaflet map
gas_map <- leaflet(data = gas_samp) %>%
  addTiles() %>%
  addMarkers(
    lng = ~xcoord,
    lat = ~ycoord,
    popup = ~paste("State: ", STATE, "<br>",
                   "County: ", county, "<br>",
                   "Address: ", ADDRESS, "<br>",
                   "Zip Code: ", ZIPnew)
  )
# Display the map
gas_map
```

## Leaflet Map
  Below, we create a slightly more better leaflet map by using the "radius" and "color" functions. Here, we have the radius fixed at 5, the color fixed at blue, and the markers as circle points instead of location markers. The map is similar to the one above in which each circle point represents a gas station. Hovering over a point will once again give the "State", "County", "Address", and "Zip Code" for that specific gas station.
```{r, comment=NA}
# Create a leaflet map
gas_map2 <- leaflet(data = gas_samp) %>%
  addTiles() %>%
  setView(lng = mean(gas_samp$xcoord), 
          lat = mean(gas_samp$ycoord), 
          zoom = 13) %>%
  addProviderTiles("Esri.WorldGrayCanvas") %>%
  addCircleMarkers(
    ~xcoord, 
    ~ycoord,
    color = "blue",  # Adjust color as needed
    radius = 5,  # Adjust radius as needed
    stroke = FALSE, 
    fillOpacity = 0.4,
    label = ~paste("State: ", STATE,
                   "County: ", county,
                   "Address: ", ADDRESS,
                   "Zip Code: ", ZIPnew)
  ) %>%
  addLegend(position = "bottomright", 
            colors = "blue",  # Adjust color as needed
            labels = "Gas Station",
            title = "Gas Stations",
            opacity = 0.4)

# Display the map
gas_map2
```

## Best Map
 This time, we create a slightly more complex leaflet map by using the "radius" and "color" functions once again. Here, we have the radius dependent on the number of POCs (points of compromise) in a gas station's respective ZIP Code. This means that the larger a point appears on the map, the more POCs exist in that station's ZIP Code. The color is dependent on whether or not a gas station provides "Fuel", "Service Only" or "Both" fuel and services. So, each of those three categories will have a specific color, representing the services provided at a specific gas station. In the data, we do not get any gas stations that provide "Service Only", so only the categories "Fuel" and "Both" will appear on the map. The map is similar to the previous maps above in which each circle point represents a gas station. Hovering over a point will once again give the "State", "County", "Address", and "Zip Code" for that specific gas station.
```{r, comment=NA}
# Create a color palette based on service_or_fuel values
service_palette <- colorFactor(palette = "Set1", domain = gas_samp$service_or_fuel)

# Create the leaflet map
gas_map3 <- leaflet(data = gas_samp) %>%
  addTiles() %>%
  addProviderTiles("Esri.WorldGrayCanvas") %>%
  addCircleMarkers(
    ~xcoord, 
    ~ycoord,
    color = ~service_palette(service_or_fuel),  # Use colorFactor
    radius = gas_samp$ZIPPOC * 10,  # Adjust radius as needed
    stroke = FALSE, 
    fillOpacity = 0.4,
    label = ~paste("State: ", STATE, "<br>",
                   "County: ", county, "<br>",
                   "Address: ", ADDRESS, "<br>",
                   "Zip Code: ", ZIPnew)
  ) %>%
  addLegend(position = "bottomright", 
            colors = service_palette(unique(gas_samp$service_or_fuel)),  # Use unique service_or_fuel values
            labels = unique(gas_samp$service_or_fuel),
            title = "Gas Stations",
            opacity = 0.4)

# Display the map
gas_map3
```


# Philly Crime Data
## Reading in Data
  We first read in two data sets called "philly" representing several different values having to due with crimes that have occurred in the Philadelphia area since 2015. "Philly" has 15520 observations and 18 total variables.
```{r, comment=NA}
philly <- read.csv("https://ecoleman451.github.io/website/Data%20Visualization/Datasets/PhillyCrimeSince2015.csv")
```
## Convert Date Variable to Date Format and Subset
  We want to only have 2023 data in our final subset for our map. Since there doesn't exist a variable for year, we have to create one using the variable date that already exists in the dataset. Once we have our Year variable, we can easily subset the data to focus only on 2023 data. Now, our new data set has 1666 observations and 19 variables, now including Year.
```{r, comment = NA}
# Convert date variable to date format
philly$date <- as.Date(philly$date, format = "%m/%d/%Y %H:%M")

# Extract year from date variable
philly$year <- format(philly$date, "%Y")

philly <- subset(philly, year=="2023")
```

## Mapping
  Now, we can create a leaflet map looking at fatal versus non-fatal crimes that occured in Philadelphia in the year 2023 by using the "color" function once again. The color is dependent on whether or not a crime was labeled as "Fatal" or "Nonfatal". So, each category will have a specific color, representing the type of crime (fatal vs. nonfatal) that occurred at the time. The map is similar to the previous maps above in which each circle point represents a specific crime. Hovering over a point will give the "Neighborhood", "Date", "Race", and "Sex", "Age", and "Street" for that specific crime.
```{r, comment=NA}
library(leaflet)
library(dplyr)
# Create color palette for fatal and non-fatal crimes
fatal <- "red"
non_fatal <- "blue"

# Create leaflet map
map <- leaflet(philly) %>%
  addTiles() %>%
  addCircleMarkers(
    ~lng, ~lat,
    color = ifelse(philly$fatal == "Fatal", fatal, non_fatal),
    radius = 5,
    label = ~paste("Neighborhood: ", neighborhood,
                   "Date: ", date,
                   "Race: ", race,
                   "Sex: ", sex,
                   "Age: ", age,
                   "Street: ", street_name),
    labelOptions = labelOptions(
      direction = "auto"
    )
  ) %>%
  addLegend(
    position = "bottomright",
    colors = c(fatal, non_fatal),
    labels = c("Fatal", "Non-Fatal"),
    title = "Crime Type"
  ) %>%
  addScaleBar() %>%
  addControl(
    html = "<h4>Philadelphia Crime Locations (2015-2024)</h4>",
    position = "topright"
  )

# Display the map
map
```
  Eyeballing the map, we can see that there seems to be a significant amount more of non-fatal crimes versus fatal crimes. Further analysis would be needed to confirm this, of course.


# Philly Crime Data
## Reading in Data
  We first read in a data set representing several different values having to due with the city of Philadelphia. The data, called "Philly", contains information on Shootings that have occurred in the area and categorizes them as either "Fatal" or "Non-Fatal". "Philly" has 15555 observations and 21 total variables.
```{r, comment=NA, results=FALSE}
philly  <- na.omit(st_read("https://pengdsci.github.io/STA553VIZ/w08/PhillyShootings.geojson"))
phillyNeighbor  <- st_read("https://pengdsci.github.io/STA553VIZ/w08/Neighborhoods_Philadelphia.geojson")
```

## Mapping
  Now, we can create a leaflet map looking at fatal versus non-fatal crimes that occured in Philadelphia by using the "color" function once again. The color is dependent on whether or not a crime was labeled as "Fatal" or "Nonfatal". So, each category will have a specific color, representing the type of crime (fatal vs. nonfatal) that occurred at the time. Each crime location is represented with a circle marker. Hovering over a point will display information for "Object ID", "Year", "Race", "Sex", "Age", "Wound", and "Location" for each of the crime points.
```{r}
# Load required libraries
library(leaflet)
library(sf)

# Convert 'philly' data to sf object
philly_sf <- st_as_sf(philly, coords = c("point_x", "point_y"), crs = 4326)

# Define color palette for fatal and non-fatal crimes
fatal_color <- "red"
non_fatal_color <- "gold"

# Create leaflet map
map <- leaflet() %>%
  addProviderTiles(providers$Esri.WorldGrayCanvas) %>%
  addPolygons(data = phillyNeighbor,
              color = 'skyblue',
              weight = 1)  %>%
  addCircleMarkers(data = philly_sf,
                   ~point_x, ~point_y,
                   color = ifelse(philly$fatal == 1, fatal_color, non_fatal_color),
                   radius = 5,
                   popup = ~paste("Object ID: ", objectid,
                                  "<br>Year: ", year,
                                  "<br>Race: ", race,
                                  "<br>Sex: ", sex,
                                  "<br>Age: ", age,
                                  "<br>Wound: ", wound,
                                  "<br>Location: ", location),
                   labelOptions = labelOptions(
                     direction = "auto"
                   )
  ) %>%
  addLegend(
    position = "bottomright",
    colors = c("red", "gold"),
    labels = c("Fatal", "Non-Fatal"),
    title = "Crime Type"
  ) %>%
  addScaleBar() %>%
  addControl(
    html = "<h4>Philadelphia Crime Locations (2015-2024)</h4>",
    position = "topright"
  ) %>%
  addProviderTiles(providers$Esri.WorldGrayCanvas) %>%
  setView(lng = -75.1527, lat = 39.9707, zoom = 11)

# Display the map
map
```


\

\

# Working Data and Data Loading
Our initial dataset contains Presidential election results from the years 2000, 2004, 2008, 2012, 2016, and 2020. The data, called "election", has 72617 observations and 12 variables containing information for every state, county, election, candidate winner in each county, and total candidate votes. 

Some data cleaning was necessary in order to fix the issue of the county FIPS code, which represents the specific 5 digit code assigned to every county in the United States. The data set initially had some codes containing 4 digits instead of 4 if there contained a "0" as the first digit. For example, Autauga, Alabama's FIPS code "01001" is recorded as "1001" in the set. This was able to be done through the excel function "TEXT" and was done prior to reading the data in as the set called "election".

Using this dataset called "election", we want to split the data so that we have information focusing only from the county level and a dataset for the state level. Both sets include a new variable called "party_percentage" which calculates the percentage of people that picked the winning party in the sets' respective state/county - "county_data" containing election results by county and "state_data" containing election results by state. Both sets also keep only the winning party in the dataset.
```{r}
# Load the required library
library(dplyr)

# Read the data
election <- read.csv("https://ecoleman451.github.io/website/Data%20Visualization/Datasets/PresidentialElection2000To2020.csv")

# County-level Data
county_data <- election %>%
  group_by(year, state, county_name) %>%
  mutate(party_percentage = candidatevotes / sum(candidatevotes) * 100) %>%
  filter(party_percentage == max(party_percentage)) %>%
  select(year, state, county_fips, party, candidate, candidatevotes, party_percentage)

# State-level Data
state_data <- election %>%
  group_by(year, state) %>%
  mutate(party_percentage = candidatevotes / sum(candidatevotes) * 100) %>%
  filter(party_percentage == max(party_percentage)) %>%
  select(year, state, party, candidate, candidatevotes, party_percentage)

# Save county-level data to a new CSV file
write.csv(county_data, file = "county_level_data.csv", row.names = FALSE)

# Save state-level data to a new CSV file
write.csv(state_data, file = "state_level_data.csv", row.names = FALSE)
```

# Choropleth Map
Now that we've created the data set called "county_data" - focusing solely on election results (specifically the winning party) by county level, we can create an interactive Choropleth Map using the data visualization tool Tableau to display presidential election results at the county level. There are two different colors assigned to represent the major political parties (Democrat & Republican) and the map shades in every county to reflect which political party was the winner in a specific election year. A filter is set to change the year(s) in the interactive map. Hover text is also included which appears when hovering over a specific county on the map, containing information for "year", "state", "party", "candidatevotes", & "party_percentage" for the respective county being hovered over.
\
<table border = 0 bordercolor="darkgreen" bgcolor='#f6f6f6'  width=100%  align = center>
<tr>
<td>
<div class='tableauPlaceholder' id='viz1712269294020' style='position: relative'><noscript><a href='#'><img alt='Choropleth Map ' src='https:&#47;&#47;public.tableau.com&#47;static&#47;images&#47;Ch&#47;Choropleth_Map_Final&#47;Sheet1&#47;1_rss.png' style='border: none' /></a></noscript><object class='tableauViz'  style='display:none;'><param name='host_url' value='https%3A%2F%2Fpublic.tableau.com%2F' /> <param name='embed_code_version' value='3' /> <param name='site_root' value='' /><param name='name' value='Choropleth_Map_Final&#47;Sheet1' /><param name='tabs' value='no' /><param name='toolbar' value='yes' /><param name='static_image' value='https:&#47;&#47;public.tableau.com&#47;static&#47;images&#47;Ch&#47;Choropleth_Map_Final&#47;Sheet1&#47;1.png' /> <param name='animate_transition' value='yes' /><param name='display_static_image' value='yes' /><param name='display_spinner' value='yes' /><param name='display_overlay' value='yes' /><param name='display_count' value='yes' /><param name='language' value='en-US' /><param name='filter' value='publish=yes' /></object></div>                
<script type='text/javascript'>                    var divElement = document.getElementById('viz1712269294020');                    var vizElement = divElement.getElementsByTagName('object')[0];                    vizElement.style.width='100%';vizElement.style.height=(divElement.offsetWidth*0.75)+'px';                    var scriptElement = document.createElement('script');                    scriptElement.src = 'https://public.tableau.com/javascripts/api/viz_v1.js';                    vizElement.parentNode.insertBefore(scriptElement, vizElement);                
</script>
</td>
</tr>
</table>            
\